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4.1 Introduction 

to LTe^ fS" 00 ?^ ° f thousands of microsatel- 

M-J . IT ! n2ed and alu Sh- rcso,u ^ genetic linkage map of the human genome 

SS^sssrir:?? key ,oo,s in tracking «« ^a, -d 

research laboratones. Short tandem repeat loci (STRs) are used in forties, identity testing 
SutlTS? 0f i° PUlatl0 , n StrUCtUrc «* wiU inc ^ - the Human Wversity projS 
S^fa^^T?" r h abUndaR£ ' ^ ^'Phicand technically very simpfetoT 
lL SSJ? 3 ? ** microsateI,ites ^ revolutionized genetic analysis. 

h fl c? polymorphism, to be exploited in genetic linkage studies were single 

sZ Z^T* V ? Mi0nS WhiCh ^""^ With5n endonuclease reckon 

i^SS^ '^rphisms could be easily detected because variant sequences either aeated 

SfSSTSST Tf"' ti0n SkeS ^ «■»*« in -triction foments of vari- 

?„!l g ?" Ct * pn,pOSed ""Stating genetic maps using restriction fragment 

lenf r^°? ,S ^K (RH ^ S) - Alth0Ugh ««* « widely distributed througKe 
SbC^ * ,S ""^ ^ l0W informativen ^- Since most RFLPs are only dimorphic 
wnen hi ST V *» nmimum heterozygosity of 50% can only occur 

JSet ^ repieSented in 3 Wlation-most RFLPs have lower heterozy- 

char^t n ri^ y J ith ^^'oP^t of RFLPs, another class of DNA polymorphism was 
fSSSS^J" ^ ° f «P««-" Tandem sequence repetition 

One t™ ^ gen ° meS ^ ^ ^ of re P^t motif have been described, 

terir T ? 3 ? re 0fre P et,tive se£ l uenc e loci is that the number of repeat units differs 
Se nntr i ^ * ° f ***** ^ Polymorphic markers based on 

vunable numbers of tandem repeats (VNTRs) are potentially very informative because of the 

naver^' rt Whk * ^ m0Sl PO^rphic VNTRs ("minisntdlites") 

3kh ?T , betWee " 12 ^ 60 ° r raorc base -P J,ire 31,11 a 'Ota! array size of 0.5 to over 
eres n!? 6 T r^" 0 " ° f minisate,litc VNTRs is that they tend to be clustered at telom- 
eres and are therefore of restricted value in constructing complete human genome maps * 

two El. 7 „ a ° f repetUive '° ci Were described WI 'th a rep** unit of only 

«vo base pa.rs-so-called -microliter.*-" I, was not. however, until 1989 that the poly- 
morphs- nature of rmcrosatellites was recognized.*" As with larger VNTRs, microsatellites 
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vary between individuals in the number of repeats in the array. Their nomenclature is infor- 

?vLZ / ° C ' Vari ° USly referred IO aS STRs ' variable sma " s ^uence markers 
(VSSMs). simple sequence repeats (SSRs). dinucleotide repeats. CA blocks etc. The repeat 

aTa m^^JT. ' '° 6 b f. a " d thC m ° St COmmon n » ic ^atellite repeat motifs are A. AC 
AAAN, AAN. AG. and AT" although the best characterized are dinucleotide (dC-dA/dG-dT) 
repeats. Microsatellites are extremely abundant, occurring with an estimated average fre- 
quency of one STR every 6 kb of human genomic sequence." Microsatellites have clear 
advantages over the other polymorphisms described above. STRs often have multiple alleles 
and many have heterozygosity frequencies of 70% or more makins them highly informative 
tor genenc analysis. In addition, the loci are small enough to be analyzed using the poly- 
merase cham reaction (PCR).' 1 -'' The significance of these factors was quickly recognized 
and microsatellites soon became markers of choice for many applications. 

4.1.1 Informativeness of Microsatellites 

The informativeness of a polymorphic marker depends upon the number of alleles and their 
relative population frequencies. In the context of genetic linkage studies (for example, predictive 
linkage analysis in a family with a genetic disease), the informativeness of a linked marker relates 
to the Ukelihood that the parental genotypes can be deduced following analysis of a child of an 
affected parent. Botstein et al. 1 described the polymorphism information content (PIC) which is 
a statistical assessment of informativeness of a marker. In order to evaluate a marker for PIC 
firsdy the frequencies of all possible genotypes for a given marker in a population and the fre- 
quencies of all mating-type combinations are estimated. Next, the probability of informativeness 
in offspring of each mating-type combination is calculated. Finally, a value for PIC is obtained 
by summing the mating-type frequencies multiplied by the probability of informative offspring. 

Marker informativeness is more easily estimated by simply counting the number of het- 
erozygotes in a suitably large sample set PIC approximates to the observed frequency of het- 
erozygosity. The greater the number of alleles at a given locus (and the more even the spread of 
allele frequencies in a population), the more informative will be the marker. This underlies the 
virtue of microsatellites in linkage analysis and gives measure to the extent to which microsatel- 
lites are much more informative than dimorphic systems such as RFLPs. 

4.2 Applications 

4.2. 1 Construction of Genetic Maps 

Genetic maps are constructed by linkage analysis. Linkage relationships (map order and 
distance between markers) are established by typing a collection of families with the markers 
of interest. Mapping information is obtained by detecting recombination between markers. 
Linkage analysis has been successful in mapping genes for a great number of inherited condi- 
tions as the first step in a positional cloning strategy. The disease itself is treated as polymor- 
phic marker with alleles "mutant" and "normal." This clearly relies upon accurate clinical 
assessment of recombinant individuals in affected families. 

4.2.2 Disease Gene Tracking 

Clinical molecular genetics laboratories make use of linked marker to perform predictive 
or presymptomatic testing for at-risk individuals in affected families. Disease genes are 
tracked through families by analyzing inheritance of markers known to be closely-linked to 
the disease. Figure 4. 1 shows an example of use of a STR to track an autosomal dominant, late 



